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Activities  and  Findings 


The  project  has  focused  on  validating  the  theoretical  results  of  characterizing  multi-scaling  processes 
in  geomorphology  using  the  Maximum  Entropy  Principle  (MaxEnt)  and  developing  a  model  of  surface 
heat  fluxes  using  the  Principle  of  Maximum  Entropy  Production  (MEP). 


1 .  Validating  MaxEnt  Solution  of  Power-law  Distribution 

The  power-law  distribution  of  drainage  area  has  been  derived  as  the  maximum  entropy  distribution 
under  the  constraint  of  given  geometric  mean.  According  to  the  Maximum  Entropy  theory,  the  power- 
law  index,  the  parameter  in  the  power-law  distribution,  is  related  to  the  geometric-mean  of  the  drainage 
areas  of  a  river  basin  and  the  lower  limit  of  the  drainage  area  that  can  be  derived  from  digital  elevation 
map  (DEM).  The  agreement  between  the  predicted  values  of  the  power-law  index  according  to  the 
MaxEnt  with  those  obtained  from  the  data  as  the  curve-fitting  parameters  is  an  indicator  that  the 
MaxEnt  can  be  used  a  general  organizing  principle  in  understanding  and  characterizing  the  multi¬ 
scaling  processes  in  sampling  network  design. 


It  has  been  known  that  the  drainage  area  for  a  river  network,^,  follows  a  power-law  distribution, 

P(A>a)cca~b  (1) 

where  8  is  the  power-law  index  that  can  be  determined  from  DEM  data.  The  MaxEnt  theory  predicts 
that  8  is  related  to  the  aggregated  properties  of  the  river  network  through, 


-mV 


(2) 


where  is  the  geometric-mean  and  A  /  the  lower-limit  of  the  drainage  areas  of  a  river  network.  Figure  1 
shows  the  DEM  of  Grand  Canyon  from  USGS  data  archive.  The  river  network  is  generated  from  the 
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Figure  1.  A  DEM  map  for  the  Grand  Canyon  from  USGS  DEM  datasets. 


Figure  2.  Computed  cumulative  distribution  of  the  drainage  areas  using  the  DEM  data  shown  in  Figure 
1.  The  predicted  value  of  8=0.317  according  to  Eq  (2),  is  in  good  agreement  with  the  estimated 
8-0.302  obtained  through  curve-fitting  the  straight-line  part  of  the  empirical  CDF. 
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Data  Source 


Predicted  Q  Fitted  Q  %  Diff  Q 


Adjuntas 

0.883 

0.963 

-8.36 

Aguas  Buenas 

0.463 

0.469 

-1.37 

Arecibo 

0.514 

0.512 

0.46 

Baldwyn 

0.466 

0.472 

-1.27 

Barceloneta 

0.466 

0.459 

1.61 

Barranquitas 

[  0.4871 

0.495 

-1.64 

Bayamon 

0.503 

0.491 

2.46 

Bayaney 

0.469 

0.478 

-1.85 

Beavercreek 

0.390 

0.403 

-3.27 

Caguas 

&470l 

0.482 

-2.54 

Camuy 

0.469 

0.486 

-3.54 

Carolina 

0.474 

0.528 

-10.16 

Cayey 

0.488 

0.492 

-0.81 

Central  Aguirre 

0.482 

0.514 

-6.22 

Central  La  Plata 

0.427 

0.421 

1.44 

Table  1.  Comparison  of  the  parameters,  8,  in  the  power-law  distributions  of  drainage  area  (Eq(l))  using 
the  MaxEnt  theory  with  those  estimated  for  the  river  networks  in  Puerto  Rico  derived  from  the  USGS 
DEM  data,  “predicted”  refers  to  the  8  derived  according  to  Eq  (2),  and  “fitted”  to  the  curve-fitting 
values  of  8. 

DEM  data  using  a  commercial  software  RiverTools.  The  computed  cumulative  distribution  ( CDF)  of  A 
is  shown  in  Figure  2.  The  MaxEnt  theory  predicts  8=0.317  according  to  Eq  (2),  while  the  empirical 
value  of  8  obtained  from  the  curve-fitting  of  the  straight  line  CDF  is  0.302,  which  is  in  good  agreement 
with  the  theoretical  value  of  0.317  according  to  the  MaxEnt  theory.  Further  tests  of  Eq  (2)  were  done 
for  several  river  networks  in  Puerto  Rico  using  the  USGS  DEM  datasets.  The  results  are  summarized  in 
Table  1. 

Of  the  fifteen  river  networks,  twelve  estimated  parameters  (“fitted”  in  Table  1)  differ  from  the 
theoretical  values  (“predicted”  in  Table  1)  by  1-3%.  The  largest  difference  is  10%.  It  is  also  found  that 
the  empirical  values  of  8  is  closer  to  the  theoretical  ones  for  larger  river  basins  than  for  smaller  river 
basins.  This  fact  suggests  that  the  some  of  the  differences  are  related  to  the  cut-off  of  the  self-similar 
behavior  of  the  systems  at  very  small  and  very  large  scales  (see  Figure  2).  For  a  wide  range  of  scales  at 
which  the  system  manifests  multi-scaling  or  self-similar  behaviors,  the  MaxEnt  principle  correctly 
predicts  the  statistical  properties  of  the  system. 

The  validation  study  is  the  crucial  first  step  in  the  study  of  Type  I  and  Type  II  multi-scaling  distributions 
for  the  proposed  data-collection  network  design  for  sampling  of  multi-scaling  processes  in  hydro¬ 
meteorology.  The  positive  results  have  provided  convincing  evidence  that  the  MaxEnt  principle  can  be 
used  as  a  unifying  theory  for  characterization  of  self-similar  processes.  This  finding  affirms  the  basic 
ideas  behind  the  project  that  the  MaxEnt  principle  is  able  to  1 )  assign  probability  distributions  to  the 
microscopic  states  of  a  system  based  on  its  macroscopic  properties;  2)  retrieve  useful  information  from 
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incomplete  noisy  data  using  entropy  as  a  consistent  quantitative  measure  of  information.  The  finding 


has  wider  scientific  implications  beyond  our  originally  proposed  study  of  sampling  network  design.  For 
example,  the  analysis  we  did  for  the  Grand  Canyon  and  Puerto  Rico  can  be  done  for  the  Martian 
surface  using  Mars  Orb  iter  Laser  Altimeter  ( MOLA )  data  as  the  topographical  features  of  Martian 
surface  visually  have  some  similarities  with  those  of  the  earth.  By  computing  the  power-law  index,  if 
the  drainage  area  for  a  Martian  “river  network”  indeed  follows  similar  distributions  as  shown  here,  we 
may  be  able  to  answer  the  challenging  question  whether  the  “river  networks”  on  Mars  were  formed  by 
flowing  water  like  on  the  earth.  As  the  most  recent  discovery  of  ice  on  Mars  by  the  Phoenix  Mars 
Mission,  the  MaxEnt  tool  developed  here  may  have  greater  potential  than  we  initially  anticipated. 

2.  Specification  of  Non-informative  Prior  and  the  Information  Entropy 

The  fundamental  idea  of  the  project  is  built  on  the  expression  of  information  entropy,  S,  that  takes  the 
form, 

S=~S f{x)\nf{x)dx,  (3) 

where  f{x)  is  the  probability  density  function  of  a  continuous  variable  x.  This  is  a  special  case  of  the 
general  expression  of  information  entropy  under  the  condition  of  uniform  probability  measure  m(x)=  1 , 

S=~S  f(x)\n^Y^-dx,  (4) 

m  ( x) 

where  m(x)  is  not  constant  in  general.  S  in  Eq  (4)  is  invariant  with  a  change  of  variable,  a  desired 
property  of  information  entropy.  m(x)  is  often  referred  to  as  ignorance  prior  or  non-informative  prior.  It 
must  be  specified  in  order  to  obtain  a  solution  of  probability  distribution  using  the  framework  of 
MaxEnt.  An  unanswered  question  is  what  functions  can  be  used  as  ignorance  priors?  Use  of  the 
uniform  ignorance  prior,  i.e.  m(x)=  1,  has  been  criticized  for:  1)  lacking  invariance  with  change  of 
variable,  and  2)  improper  (non-integrable)  over  an  infinite  domain.  Since  the  formulation  of  the  project 
assumes  uniform  ignorance  prior,  a  justification  is  needed  for  the  purpose  of  this  project  in  particular, 
and  for  answering  the  question  of  ignorance  priors  in  general. 

Investigation  of  the  problem  has  led  to  the  following  conclusions  1 )  Ignorance  prior  is  identical  to  the 
/imiting  density  of  discrete  points  (LDDP)  function,  and  is  not  unique  due  to  unlimited  possibilities  of 
re -parameterization  (or  change  of  variable).  A  LDDP  function  does  not  have  to  be  proper;  2)  The 
Principle  of  Indifference  is  a  logical  and  sufficient  rule  for  selecting  ignorance  priors  applicable  to 
discrete  as  well  as  continuous  parameters.  The  Principle  deprives  any  arbitrariness  and  ambiguity  in 
selecting  ignorance  prior  distribution;  3)  A  probability  distribution  defined  on  continuous  parameter(s) 
consists  of  two  terms:  a  LDDP  function  enumerating  the  hypotheses  that  is  variant  with  re¬ 
parameterization  and  a  probability  function  quantifying  the  likelihood  of  each  hypothesis  that  is 
invariant  with  re-parameterization. 

These  results  justify  the  use  of  uniform  ignorance  prior  in  the  expression  of  information  entropy.  They 
have  important  theoretical  implications.  The  concept  of  the  LDDP  facilitates  the  resolutions  of  some 
puzzles  in  the  probability  theory  such  as  Borel-Kolmogorov  paradox  and  Bertrand's  problem.  These 
results,  if  accepted,  lay  a  sound  foundation  beneath  the  Bayesian  analysis  of  a  physical  system  with 
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incomplete  information  using  the  framework  of  MaxEnt. 


3.  Field  Experiment 

A  field  experiment  at  the  Lucky  Hill  site  (31°  44.564'N,  110°  3.25 l'W)  in  the  Walnut  Gulch  watershed 
was  conducted  2-17  June  2008  using  the  equipment  acquired  under  the  support  of  another  ARO  project 
W91NF-06-1-0224  to  measure  a  full  suite  of  surface  energy  balance  and  soil  variables  including 
downward  and  upward  solar  and  long  wave  radiative  fluxes.  A  eddy-covariance  system  was  deployed  to 
measure  turbulent  and  conductive  heat  fluxes  at  the  surface  together  with  skin  temperature  and  soil 
temperature  at  a  certain  depth.  The  experiment  was  during  the  pre-monsoon  season  characterized  by 
strong  insolation  and  occasional  clouds.  The  experimental  site  is  located  at  a  mostly  bare  spot  over  a 
flat  area.  The  soil  does  not  contain  removable  moisture  from  the  surface  down  to  at  least  tens  of 
centimeters.  The  sampling  period  followed  an  extensive  period  without  rain.  Skin  temperature  was 
measured  using  an  infrared  thermometer.  All  variables  were  sampled  every  10  seconds.  The  collected 
data  (Figures  3-4)  will  be  used  to  understand  the  fundamental  physics  behind  the  partition  of  net 
radiation  into  turbulent  and  conductive  heat  fluxes  from  the  perspective  of  MaxEnt  and  the  maximum 
entropy  production  theory.  The  analysis  of  the  dataset  is  underway. 


Wind  Speed  -  X  direction 


Sampling  Interval  -  5min 


Figure  3.  Wind  velocities  (sampled  at  10  Hz)  measured  by  the  eddy-covariance  device  at  the  Walnut 
Gulch  watershed  during  9-15  June  2008. 
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Energy  Fluxes  using  averaging  time  of  300seconds  -  moving  average  of  20mins 


Figure  4.  Surface  energy  fluxes  (sensible,  ground,  up-  and  down- ward  solar  and  long- wave)  measured 
by  the  eddy-covariance  device  and  radiometers  at  the  Walnut  Gulch  watershed  during  9-15  June  2008. 


4.  Development  of  a  MEP  Model  of  Surface  Heat  Fluxes 

As  a  proof-of-concept  study,  a  model  of  heat  fluxes  with  a  stationary  hypothesis  of  the  energy  balance 
over  a  dry  land  surface  are  proposed  based  on  the  MEP  theory.  The  first  step  is  to  formalute  an  entropy 
production  function  or  dissipation  function  as  a  nonlinear  function  of  the  heat  fluxes  including  “thermal 
inertia”  parameters  to  represent  the  corresponding  transport  mechanism.  The  “thermal  inertia”  for  the 
turbulent  heat  transfer  in  the  atmospheric  boundary  layer  is  parameterized  using  a  new  solution  of  the 
Monin-Obukhov  similarity  model.  A  solution  of  the  heat  fluxes  can  be  obtained  by  finding  the  extreme 
of  the  dissipation  function  under  the  constraint  of  conservation  of  energy  for  a  given  energy  input  (i.e. 
net  radiation)  at  the  surface.  The  MEP  solution  of  the  surface  heat  fluxes  is  tested  using  observations 
from  fields  experiments.  A  sample  result  is  displayed  in  Figure  5  where  the  heat  fluxes  predicted  by  the 
MEP  theory  agree  closely  with  the  observations. 

There  are  several  important  findings  and  conclusions  from  this  study.  The  stationary  hypothesis  adds 
another  example  of  natural  phenomena  understood  from  the  perspective  of  an  optimality  argument.  The 
MEP  method  for  modeling  and  estimating  surface  heat  fluxes  complements  the  existing  methods  as  it 
only  needs  net  radiation  input  without  requiring  other  meteorological  variables  such  as  temperature  and 
wind  speed.  The  MEP  model,  not  replace  the  existing  ones,  offers  an  alternative  method  that  would 
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enhance  the  models  of  land  surface  energy  balance.  The  concept  of  entropy  production  is  not  limited  to 
the  thermodynamic  entropy  production.  Depending  on  the  quantities  of  interest,  the  “entropy 
(production)”  defined  in  the  MaxEnt  and  MEP  formalisms  is  not  always  related  to  the  thermodynamic 
entropy.  This  study  presents  an  example  where  the  “entropy  production”  is  different  from  both 
thermodynamic  entropy  production  and  (thermal)  energy  dissipation  that  are  often  used  in  the  models  of 
non-equilibrium  systems.  The  MEP  method  is  expected  to  be  applicable  to  the  energy  balance  over  a 
wet  or  a  vegetated  land  surface. 


Owens  Lake  data.  MEP  fuxess  (broken),  observed  fluxes  (solid) 


z=2.2m 


QMEP  (W  m"^ 


obs 

given  Rn 


H^W*) 


Figure  5.  (a)  Time-series  of  the  MEP  solution  of  G  and  H  (broken  lines)  vs  the  observed  fluxes  (solid 
lines)  with  the  observed  Rn  as  the  input  to  Eq  (15);  MEP  solution  of  G  (b)  and  H  (c)  vs  observations. 
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20-minute  average  data  collected  at  Owens  lake,  California,  20  June  to  2  July  1993. 

5.  Turbulence  of  Atmospheric  Boundary  Layer 

A  new  solution  of  the  Monin-Obukhove  similarity  equations  for  the  turbulence  in  the  atmospheric 
boundary  layer  has  been  derived  and  tested,  which  plays  a  critical  role  in  the  development  of  MEP 
model  of  surface  heat  fluxes  reported  above.  The  major  findings  include  the  extremum  solution  of  the 
Monin-Obukhov  similarity  equations:  1)  is  the  only  mathematically  consistent  and  physically  realistic 
solution  of  the  M-0  similarity  equations;  2)  has  overcome  some  technical  difficulties  (e.g. 
nonuniqueness  and  non-convergence)  in  applying  the  MOST  in  modeling  of  turbulent  transport  in  the 
ASL;  3)  opens  a  possibility  of  simplifying  the  MOST  formalism  by  replacing  the  two  empirical  stability 
functions  by  some  empirical  constants;  4)  unifies  the  asymptotic  solutions  of  the  ASL  derived  from 
various  arguments;  5)  has  a  solid  foundation  built  on  the  modern  non-equilibrium  thermodynamics;  and 
6)  plays  a  crucial  role  in  successful  modeling  of  the  surface  heat  fluxes  based  on  the  emerging  theory  of 
maximum  entropy  production.  Our  analysis  also  raises  a  doubt  on  the  classical  interpretation  of  the 
Obukhov  length  arguably  due  to  a  mathematical  artifact.  Resolution  of  the  issue  may  offer  new 
opportunities  in  improving  the  atmospheric  turbulence  models. 
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